Towards a neurocomputational model of speech production and perception

نویسندگان

  • Bernd J. Kröger
  • Jim Kannampuzha
  • Christiane Neuschaefer-Rube
چکیده

The limitation in performance of current speech synthesis and speech recognition systems may result from the fact that these systems are not designed with respect to the human neural processes of speech production and perception. A neurocomputational model of speech production and perception is introduced which is organized with respect to human neural processes of speech production and perception. The production–perception model comprises an artificial computer-implemented vocal tract as a front-end module, which is capable of generating articulatory speech movements and acoustic speech signals. The structure of the production–perception model comprises motor and sensory processing pathways. Speech knowledge is collected during training stages which imitate early stages of speech acquisition. This knowledge is stored in artificial self-organizing maps. The current neurocomputational model is capable of producing and perceiving vowels, VC-, and CV-syllables (V = vowels and C = voiced plosives). Basic features of natural speech production and perception are predicted from this model in a straight forward way: Production of speech items is feedforward and feedback controlled and phoneme realizations vary within perceptually defined regions. Perception is less categorical in the case of vowels in comparison to consonants. Due to its human-like production–perception processing the model should be discussed as a basic module for more technical relevant approaches for high-quality speech synthesis and for high performance speech recognition. 2008 Elsevier B.V. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performing Identification and Discrimination Experiments for Vowels and Voiced Plosives by Using a Neurocomputational Model of Speech Production and Perception

A neurocomputational model of speech production and speech perception is introduced. After training, i.e. after mimicking early phases of speech acquisition, the model is capable of producing and perceiving vowels and CV-syllables (C = voiced plosives). Different instances of the model were trained for representing different “virtual subjects” which are then used as listeners in identification ...

متن کامل

Constructing Cerebellum Model by Researching on its Contributions to DIVA

DIVA (Directions into Velocities of Articulators) is a mathematical model of the processes behind speech acquisition and production, supposed to achieve a functional representation of areas in the brain that are involved in speech production and speech perception. Introducing cerebellum control mechanism into the model plays a significant role in improving the mechanism of speech acquisition an...

متن کامل

Relationship between Working Memory, Auditory Perception and Speech Intelligibility in Cochlear Implanted Children of Elementary School

Objectives: This study examined the relationship between working and short-term memory performance, and their effects on cochlear implant outcomes (speech perception and speech production) in cochlear implanted children aged 7-13 years. The study also compared the memory performance of cochlear implanted children with their normal hearing peers. Methods: Thirty-one cochlear impl...

متن کامل

Towards a Contrastive Pragmatic Analysis of Congratulation Speech Act in Persian and English

This paper aims at studying the speech act of congratulation in Persian and English with regard to semantic formulas. To gather the semantic formulas related to congratulation, the researchers chose 100 movies (50 in Persian and 50 in English) as the instrument of the study. The only model of cross-cultural comparison was related to that of Elwood (2004). Therefore, we used Elwood’s model as th...

متن کامل

The integration of large-scale neural network modeling and functional brain imaging in speech motor control

Speech production demands a number of integrated processing stages. The system must encode the speech motor programs that command movement trajectories of the articulators and monitor transient spatiotemporal variations in auditory and somatosensory feedback. Early models of this system proposed that independent neural regions perform specialized speech processes. As technology advanced, neuroi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Speech Communication

دوره 51  شماره 

صفحات  -

تاریخ انتشار 2009